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(57) ABSTRACT 

Devices are registered within a zone using a service discov- 
ery protocol. At the time of registration, a device sends a 
grammar and a set of help sequences to a control unit in the 
zone to be merged into a composite grammar and composite 
set of help sequences. If the voice command iddued by a user 
is recognized, an appropriate method invocation is sent to 
the device to perform the desired operation. If the device in 
the voice command is recognized but the operation is not 
recognized, then the user is provided with a help sequence 
on how to properly perform operations with the device. If 
the device is not recognized, the command is forwarded to 
another zone within a set of zones until either the device is 
found or no such device is located. 
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FIG. 2 
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FIG. 3 
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FIG. 4 
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FIG. 7 
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FIG. 8 
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METHOD AND SYSTEM FOR VOICE however, as well as a preferred mode of use, further objec- 

CONTROL ENABLING DEVICE IN A tives and advantages thereof, will best be understood by 

SERVICE DISCOVERY NETWORK reference to the following detailed description of an illus- 
trative embodiment when read in conjunction with the 

BACKGROUND OF THE INVENTION 5 accompanying drawings, wherein: 

1 Technical Field 1 * s a fl° or P^ an f° r a typical home where the 

Tlic present invention relates generally to improved auto- invention could be implemented through various zones of 

mation in the home and business and in particular to a control; 

method and a system for voice control of devices. Still more FIG. 2 shows a network of devices for the zones of control 

particularly, the present invention provides a method and a 10 from FIG. 1 in accordance with a preferred embodiment of 

system for plug-and-play voice control capability exploiting the invention; 

service discovery mechanisms available in the network. p IG 3 ^ a b i ock diagram showing the major functional 

2. Description of the Related Art regions of an embedded microcontroller that could be used 

Voice control of devices has been investigated for some to implement the current invention; 

time and devices are marketed that can respond to simple ]S piG 4 shows me connectioD of a voice enabled toaster t0 

commands For example, a cell phone may have a voice {h& ^ Qetwork m tfae kitchen {q accordance ^ a 

controlled directory where telephone numbers are automati- ^ embodiment of the invemion; 

cally dialed by speaking the name of the person to be called. r t t , . , , 

Often these devices are quite simple and can only respond to FIG 5 shows a f bset of commands recognized by the 

a limited number of commands. Rarely can such devices 20 system controlling the toaster m accordance with a preferred 

provide help to guide the user through a complex sequence embodiment of the invention; 

of voice commands. FIG* 6 illustrates a complete scenario of using a voice 

Hie technology for voice control is fairly well understood control system in accordance with a preferred embodiment 

but can be very expensive for a sophisticated system of °^ tne i nvent i° n ; 

command and control that supports verbal interactions with 25 FIG. 7 illustrates an alternative scenario of using a voice 

the user. As a minimum, such a system would require a control system with prompt messages in accordance with a 

microphone, a voice recognition system, a control system preferred embodiment of the invention; 

for operations performed by voice command, a speech FIG. 8 shows a composite grammar after several appli- 

generation system, a speaker, and a microprocessor to coor- ances have been plugged into the kitchen zone of a home 

dinate the various activities. To embed this level of sophis- 30 network in accordance with a preferred embodiment of the 

tication into a simple household device, such as a invention; and 

microwave, or a simple business device, such as a dictation FIG. 9 illustrates some words that are unique to the toaster 

machine, would be prohibitively expensive. that have not already been included in the composite gram- 

Therefore, it would be advantageous to have voice ^ mar shown in FIG. 8 in accordance with a preferred embodi- 

enabled devices that provide sophisticated voice control, ment of the invention, 
that are capable of interacting with a user, that are plugged 

in and work immediately, yet devices that are affordable for DETAILED DESCRIPTION OF THE 

a typical consumer. PREFERRED EMBODIMENT 

SUMMARY OF THE INVENTION 40 w * tD reference now to the figures, and in particular with 

, , . , . . r j * reference to FIG. 1, a diagram is provided of a typical house 

Amethod and system provide control for a set of devices divided ^ fc cQntrol zQnes ^ fooms haye been 

through voice commands of a user. Devices are registered diyided {n[Q cmiTo] mms baged on ^ 

within a zone using a service discovery protocol. At the time Qf ^ rooms In ticular> ^ t 102 contains the 

of registration, a device sends a grammar and a set of help and ^ indudmg a bre akfast room and a 

sequences to a control unit in the zone to be merged into a d room ^ 2 m ^ ^ ^ ^ 3 

composite grammar and composite set of help sequences. 1Q6 ^ ^ master suite &rea ^ contains (he mastef 

Typically transmission is via a power line information bedroomj the master bathj a walk-in closet. Zone 4 108 

transfer protocol when the device is plugged in or connected, ^ ^ chndren afea and ^ bedrooms for children 

although any means of data communication will suffice. The and & bath ^ 5 uo ^ ^ fa area 

voice recognition and voice generation hardware and sort- 3U . . . ■ ■ ™^ * 

7 » 1 , |i ■% .... „ a If t . Although the vanous rooms in the zones shown in FIG. 1 

ware are shared between all devices within a zone. It the ? „ - , . , , . , 

..j, • ■ 1 nn n , were contiguous for ease of drawing, the logical zones in a 

voice command issued by a user is recognized, an appro- & ? ' B 

.1 j * . t „ tU ,„ t~ „ a ,r^ ".l. home arrangement need not be physically next to each other 

pnate method invocation is sent to the device to perioral ihe an^wiiVM^ ~ F j j 

desired operation. If the device in the voice command is ° r even 00 , the same * oor > 10 ,he <?" ° f a "ulti-floor house 

recognized but the operation is not recognized, then the user « l 01 < he ^7 I00m ml &\ be ^ered part of 

is provided with a help sequence on how to properly perform ^ ne 1 even th ° u S h * 1S , no n ^ *> ^ h a8 J k,wn - ^f 0 

operations with the device. If the device is not recognized, tbe ">« s ma y be more "^alaed A ™ showD - For exam P f le > 

the command is forwarded to another zone within a set of Zo™. 1 m W actuallv contain three separate zones, one for 

zones until either the device is found or no such device is f he , d ' mn 8 for the kitchen and one for the 

located. This voice command system can be used in multiple « breakfast «x,rn- Other zones may be decomposed ,n a 

zones of a residence, in multiple zones of a business, or in simUar manner dependmg on the placement of rmcrophones 

multiple zones of any "space" occupied by the user, such as and s P eakers - 

an automobile. w ' m reference now to FIG. 2, the zones of control shown 

in FIG. 1 are organized into a service discovery network 

BRIEF DESCRIPTION OF THE DRAWINGS 65 with a variety of a6vic6S shown in eac h zone. Each zone has 

The novel features believed characteristic of the invention its own local service discovery network, which is a normal 

are set forth in the appended claims. The invention itself, data communications network, augmented with software 
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implementing a service discovery protocol such as By sharing hardware and software it becomes feasible for 

Microsoft's Universal Plug and Play or Sun Microsystem's even simple, low-cost devices, such as the toaster shown in 

Jini. Each of these service discovery networks can be joined, Zone 1 devices 212 of FIG. 2, to be able to respond to voice 

forming a federated service discovery network covering the commands without containing all the hardware and software 

entire house. The zones from FIG. 1 become nodes in the 5 required for a stand-alone voice recognition and generation 

house network. Zones 1-5 correspond to nodes 202, 204, system. Voice generation is used to provide assistance to the 

206, 208, and 210, respectively. Each device or appliance is user m the evem a comman d i s not recognized or to guide 

connected to one of these nodes. These devices share one or the user m tne ^ of the system> Thi& voice recognition and 

more common microphones to pick up voice commands. generation is performed in conjunction with a service dis- 

Sorne appliances are "universal", in the sense they are 10 covery network. The service discovery provides the dynam- 

common to every zone. Examples of "universal" devices are ics of devices entering and i eav ing the network, which, in 

lights, audio devices such as an intercom for piped in music turn ^ prov ides the events and data to update the command 

or room-to-room communications, climate control devices, and control grarnrnar f or devices in a given zone. 

such as heating and air conditioning, and security devices. refcrence qow tQ m a m shows ^ 

Other devices associated t with Zones 1-5 are shown in 15 connectioQ of a toastef t0 tne loca , net work in the kitchen. 

groups 212, 214, 216, 218, and 220, respectively. These ™ , AM . . . . 

b F 9 , i j ■ . Toaster 402 contains a microcontroller, such as the micro - 

zones are interconnected and tor some devices, such as the . n . . mr > , 1t ■ „„„ a , tUnt „ „ . 

. . . controller shown in HG, 3. It is assumed that connection to 

security system, may have a central control, such as control t , # , . . . , . , u # 

J J ' J the network is via the power cord, so the user simply has to 

center s own. in the toaster and signals are sent to and from the toaster 
This invention allows control of devices through voice 20 . lhe same lme that connects poweL ^ p0W erline- 
commands where the command and control hardware and based daU communications protocols X.10 is an example 
software are shared amongst the individual devices. The cd that can use a powerline for communications. A 
service discovery mechanism locates each device. Since all discovery prot0 col, such as Jini, Universal Plug and Play, or 
devices shown require electrical power, when a device is Sefvice Location Protocol, causes toaster 402 to be regis- 
plugged in, it can be automatically connected to the network 25 tefed witfa Sefvice Disc0 very Protocol Registry 404. 
through some protocol, such as Sun Microsystems* Jim. , , . , . . , . , , 1 • 
Other commonly occurring discover protocols are "Univer- ™ e local " elwo * I n the ta ' chen con . a,ns ^ 
sal Plug and Play" by Microsoft and Service Location 406 ' aerophone 408, speech recogmt.on system 410, 
Protocol from the Internet Engineering Task Force (IETF). s P eech generation system 412, and speaker 414. Th.s hard- 
Some devices are likely to remain in a specific zone, such as 30 Wa * «>f'^ -s shared by all of the devices in the 
the refrigerator in Zone 1 202, while other devices are res P° ndln g to VO f, TTf 
portable and might be moved from zone-to-zone, such as the mea ^ ,ha ' each <! evice » M f fo ^ d , to d "P hc '! e thes ^ 
video game controller shown in Zone 4 208. Every device facllltics ' thus making each device s.mpler and within a cost 
connected to the network will have to contain a microcon- ran 8 e acce P ,able «° consumers. 

trailer and software in order to participate in the service 35 Although a preferred embodiment of the invention uses 

discovery protocol and register tbe device when connected lhe P ower line for communications, as one of ordinary skill 

to the network. The device must be able to receive (and the art will appreciate, a variety of wired and wireless 

respond to) commands or messages sent over the data communications media are possible. Wired connections 

communications network as a result of voice commands include, but are not limited to, a power line based system, a 

issued by a user. 40 universal serial bus (USB), a firewire (IEEE 1394), and a 

With reference now to FIG, 3, a simple layout diagram proprietary wiring system. Wireless communications 
shows the major functional regions of an embedded micro- include ' but are not hmiled to r an infn * red data transfer, a 
controller. The heart of a microcontroller is microcontrol 900 MHz cordless P hone transfer > a 2 4 GHz cordless P hone 
(juC) logic 302 and, optionally, logic for digital signal transfer » and a Propnetary wireless system, 
processing (DSP). Programs and data are stored in random 45 witn reference now to FIG. 5, a subset of commands 
access memory 304, such as DRAM (dynamic random recognized by the system controlling the toaster is shown, 
access memory) or SRAM (static random access memory). The left hand column shows the recognized command 
If a program component never changes, it can be stored in sequences in Java Speech Markup Language (JSML). Asso- 
read only memory (ROM) 306, Some variations of ROM are ciated with each recognized command sequence is a Java 
one-time programmable (OTP) memory or FLASH memory. 50 method that can be invoked to cause the device to function 
FLASH memory can be written in blocks but cannot be as commanded. For example, the command sequence gen- 
written a byte at a time, like random access memory. Finally, erated by "toaster pop up" results in the command and 
an embedded microcontroller must have an interface to control software, working with the service discovery 
peripheral devices. This may take the form of application register, invoking the method "popUp( )". This method 
specific integrated circuit (ASIC) 308. For this invention, it 55 invocation is sent to the device and the device must be 
is important that the microcontroller have network access capable of carrying out the appropriate operation either 
capability. electronically or mechanically. For example, the "popup( )" 

Controlling a device through voice commands is not new. method at the toaster would cause the carrying tray in the 

For example, some electronic telephones and some cell toaster to be raised and the power to the heating unit to be 

phones allow the user to dial a number simply by speaking 60 removed. Each voice enabled device in a network would 

the name of the person to be called. What makes this need appropriate means to carry out the operations specified 

invention unique is that devices can share speech recogni- by voice commands. 

tion and speech generation hardware and software through As one of ordinary skill in the art will appreciate, markup 

network connections, the devices are added to a network of language protocols can be different than JSML and the 

devices using a service discovery protocol, and the grammar 65 programming language implementing the methods can be 

and help sequences for all devices in a zone are merged into different than Java. It is important to note that alternative 

a composite grammar and composite set of help sequences. command forms might trigger the same operation. For 
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example, the command "toaster increase temperature" allows the system to guide the correct response pattern. This 

causes the "incrementTemperature( )" method to be is illustrated in FIG. 7. 

invoked. The command "toaster increase darkness" would Th e kitchen zone, 202 in FIG. 2, is illustrated with three 
also cause the "incrementTemperature( )" method to be devices: toaster 702, oven 704, and microwave 706. Micro- 
invoked. The same flexibility in vocabulary could be used to 5 phone 708 receives user commands and speaker 710 uses 
make the system multilingual, such as English/Spanish/ digitized voice to communicate with human user 712. A 
French. hypothetical dialog is shown with human user commands 
This invention can be used in a speaker independent 714 and system responses 716. The device being controlled 
environment similar to that found in automatic telephone is microwave 706; this device has many more commands 
answering systems where any user can speak simple com- 10 than a toaster. User 712 has elected to have the system 
mands without training the system. For commands within a prompt for responses. 

restricted vocabulary, such systems can achieve an accept- The human user says "microwave" and then pauses. The 
able rate of correct recognition. This invention can also be system detects this pause and prompts the user by respond- 
used with systems that provide training to recognize voice ing with "Normal, defrost, or special?" The user responds 
commands from individual users in order to increase the 15 "defrost" and then pauses to be prompted again. The remain- 
accuracy of the voice recognition system. Tnis may be ing dialog shown in 714 and 716 is self-evident. If human 
desirable in a home environment where the same residents user 712 is familiar with this command sequence, then she 
will be using the system continually on a long term basis. could speak the commands directly without pauses as 
FIG. 5 illustrates the correspondence between a sequence "microwave defrost later five fifteen PM three point two 
of commands recognized by speech recognition system 408 20 pounds." 

in FIG. 4 and a method invoked in the target device to cause The dialogs illustrated in FIGS. 6 and 7 are for illustrative 
the action to be performed. With reference now to FIG. 6, an purposes only. The devices, vocabulary, and format of 
action diagram embeds this association into a complete commands can vary. A simple command system might 
scenario that illustrates a typical use of the voice control require a device name followed by an operation, such as 
system. Objects 600 include, from left to right, the registry, 25 "toaster pop down." A more sophisticated language under- 
time toaster, the speaker, the speech generation system, the standing system may be able to parse a natural language 
local network control, the speech recognition system, and sentence, such as "Please start the toaster*', and determine 
the microphone, corresponding to objects 404, 402, 414, this would invoke the method "popDown( )" on the toaster. 
412, 406, 410, and 408 in FIG. 4, respectively. The human ^ i n a similar manner, help sequences may vary in sophis- 
user, shown to the right, is also an object or actor in this tication. In a simple embodiment, all help sequences may be 
scenario. generated from the grammar sequences transferred from the 
The human user first plugs in the toaster (action 602). device to the control logic at registration. For example, as 
When the toaster detects it has been connected to a service shown in FIG. 5, the grammar sequences "Toaster pop up", 
discovery network, it immediately begins participating in 35 "Toaster pop down", "Toaster increase temperature", 
the service discovery protocol by sending out signals for "Toaster decrease darkness", and "Toaster help" have been 
registration. A discovery protocol causes the toaster to be registered. If the user says "toaster help", then the control 
registered with the Service Discovery Protocol Registry logic could generate the help message: "Please select one of 
(action 604). Since the toaster's registration contains the the following operations — pop, increase, or decrease." If the 
grammar sequences of recognized words, this information is 4Q user says "pop", then the help might continue: "up or 
installed with the speech recognition module at the time of down?" These simple help sequences would only depend on 
registration (step 606). There may also be help commands the grammar sequences transferred to the network at the 
and control scripts which are, registered with the speech time of registration. As one of ordinary skill in the art will 
generation unit (step 508). In FIG. 6 we assume the regis- appreciate, FIG. 5 only presents a subset of possible com- 
tration is successful. A . mands for a toaster. 

45 

The human then speaks the command "toaster begin" With a more sophisticated system, for instance, a navi- 

(action 610). The analog signal at the microphone is con- gation system in an automobile, specialized help sequences 

verted to a digital signal and sent to the speech recognition can be transferred from the device to the controller at the 

system (action 612). The speech recognition system identi- time of registration. The voice enabled devices based on a 

fies the device "toaster" but fails to recognize the operation 50 service discovery network described in this invention can be 

"begin" (action 614). Since the operation was not adapted to a wide variety of help scenarios, 

recognized, a help routine for the object toaster is sent to the Returning to the three cooking devices illustrated in FIG. 

speech generation system (action 616). This message is 7, it is possible that many of the words in the vocabulary 

converted from digital to analog and sent to the speaker ma y be similar (e.g., "temperature", "increase", "decrease") 

(action 618). The human user listens to these instructions 55 ana - omer words may be unique to a particular device (e.g., 

(action 620). "pop" for the toaster). Each device has its own grammar 

The human now knows the proper command and speaks sequences and corresponding device commands and this 

"toaster pop down" (action 622). This analog signal is information is stored in the microcontroller, such as the 

digitized (action 624) and all three words are recognized by microcontroller shown in FIG. 3. At the time the device is 

the speech recognition system (action 626). This caused the registered, this information is sent to speech recognition 

"popDown( )" method invocation to be sent to the toaster system 410 in FIG. 4. In other words, the information from 

(action 628). Action 628 corresponds to the second line of each device in the zone is merged into a composite set of 

JSML to method invocation pairs shown in FIG. 5. grammar sequences and corresponding device commands 

FIG. 6 illustrates a human user trying to issue a voice used by the speech recognition and generation systems. The 

command directly to a device. The command was not 65 aggregate grammar controls all the devices in the zone, 

recognized so the system provided instructions. The current If a device is removed from the network, the information 

invention can be used in an alternative mode where the user unique to that device is removed from the composite infor- 
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matioQ. Detection of device removal depends on the service Although this invention has been described in the context 

discovery protocol that is being used. For sophisticated of a home environment, the same system and method can be 

devices with elaborate shutdown procedures, such as found used in a variety of different environments, such as an office 

on personal computers, commands could be sent to the environment, and a variety of different devices, such as a 

central controller instructing that the device be removed 5 copy machine or a fax machine. The preferred embodiment 

from the network. Simpler devices, such as a toaster, might of the invention uses power cords to transmit signals to and 

be unplugged before any removal signal is sent to the from the network, but other media could be used, including 

controller. In this case, if a command is sent to the device wireless media. 

and the device does not respond, then the controller could It is important to note that, while the present invention has 

search for the device in the zone and, if not found, in other been described in the context of a fully functioning data 

zones. If the device is not found anyway, the user would be processing system, those of ordinary skill in the art will 

informed the device is no longer on the network and the appreciate that the processes of the present invention are 

information uniquely related to this device would be capable of being distributed in the form of a computer 

removed from the composite information. readable medium of instructions and a variety of forms, and 

Many service discovery protocols contain "self-healing" 1S that the P resent invention applies equally regardless of the 

features that can detect devices that no longer communicate particular type of signal bearing media actually used to carry 

with the network and remove their entries from the registry. out the distribution. Examples of computer readable media 

Jini, for example, uses a leasing mechanism. When the include recordable-type media such a floppy disc, a hard 

registry can no longer renew a lease with a device, the disk drive > a and CD-ROMs and transmission -type 

registry assumes the device is no longer connected to the 20 media such ™ and analo S communications links, 

network and purges the record of the device from its Tn e description of the present invention has been pre- 

database. sented for purposes of illustration and description but is not 

With reference now to FIG. 8, a composite grammar is intended to be exhaustive or limited to the invention in the 

shown after several appliances have been plugged into the form Closed. Many modifications and variations will be 

kitchen zone of a home network. For ease of understanding, 25 apparent to those of ordinary skill in the art. For example, in 

these words have been grouped into various word types. the automotive industry many devices, such as audio system, 

However, as one of ordinary skill in the art will appreciate, navigation systems, and communication systems, are added 

inside the system the words may be organized in a different or modified at the importer or dealership level. Each of these 

manner, such as alphabetic order. It is assumed that the devices would benefit greatly from a voice command system 

toaster, shown as object 402 in FIG. 4, has not been plugged 30 but it is not known at the time of manufacture which devices 

^ vet the user will select. With the plug-and-play flexibility of a 

Hie' toaster will contain within its own microcontroller the xrvi<x dis « )ve ?' Detwork > the voice con^ol facilities will 

grammar sequences it will recognize and the corresponding automatically adapt to new commands and help sequences 

device commands. It is possible to store help messages to as these voice control devices are added to the automobile 

guide the user. The local network controller has to translate 35 ™e embodiment was chosen and described in order to bes 

the command sequences into method invocations, as shown ex P am }** «ples of the invention and the practical 

in FIG. 5. All of the words in a device's vocabulary are sent application, and to enable others of ordinary -skill in the art 

to the voice recognition and voice generation systems at the t0 ^erstand the invention for various embodiments with 

time the device is registered, as shown in steps 606 and 608 vanous modifications as are suited to the particular use 

in FIG. 6. FIG. 9 illustrates some words that may be unique 40 ^emplated. 

to the toaster that have not already been included in the ^ * c alme 

composite grammar. Of course, the "unique" words will L Method m a drtnbnted data processing system for 

depend on the words already stored in the composite gram- controlling, through voice commands, a device located in 

mar. If this is the first device to be connected and that one m ™ ° f a P lurallt y of zones " tne m6tnod uprising the 

grammar is empty, then every word associated with the 45 Ste ^ S 0 ' 

toaster is "unique" and will be added to the composite registering the device with the distributed data processing 

grammar. Alternatively, there may be some words built into svstem at a ihG device 15 lirlked t0 the ^tributed 

the base system, such as spoken numbers, that are recog- data P rocess ing system; 

nized before any individual devices are plugged in. The receiving a set of grammar sequences and a corresponding 

words shown in FIGS. 4, 8, and 9 are for illustrative 50 set ° f d ^ v ice commands from the device at a time of 

purposes only and are not intended to limit the scope of the registration; 

disclosed invention. receiving a voice command for the device in the distrib- 

The zones shown in FIG. 2 can be connected to form a uted data processing system; 

federation of zones. In this manner, it would be possible to converting the voice command into a grammar sequence; 

control a device in one zone while actually being present in 55 and 

another zone. For example, suppose the user wants to open if the grammar sequence matches a grammar sequence in 

the garage door but is currently located in the master the set of grammar sequences, issuing a corresponding 

bedroom. The correct command sequence might be "garage device command to the device, 

door open". When the speech recognition system in Zone 3 2. The method of claim 1, wherein the set of grammar 

for the master suite does not recognize the device "garage 60 sequences and the corresponding set of device commands 

door", it can query the other zones in the house to see if the are received from the device via a wired connection by one 

device is recognized. Zone 5 would acknowledge a device of a power line based systems, a universal serial bus, a 

named "garage door*' and would be able to carry out the firewire, and a proprietary wiring system, 

operation "open". This federated network could either be 3. The method of claim 1, wherein the corresponding 

organized into a ring of zones with information being passed 65 device command is sent to the device via a wired connection 

around the ring or it could be organized in a hierarchy with by one of a power line based system, a universal serial bus, 

a central control at the house level. a firewire, and a proprietary wiring system. 
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4. The method or claim 1, wherein the set of grammar 
sequences and the corresponding set of device commands 
are received from the device via a wireless connection by 
one of an infrared data transfer, a 900 MHz cordless phone 
transfer, a 2.4 GHz cordless phone transfer, and a proprietary 
wireless system. 

5. The method of claim 1, wherein the corresponding 
device command is sent to the device via a wireless con- 
nection by one of an infrared data transfer, a 900 MHz 
cordless phone transfer, a 2.4 GHz cordless phone transfer, 
and a proprietary wireless system. 

6. The method of claim 1, wherein the device is registered 
via a service discovery protocol. 

7. The method of claim 1, wherein the grammar sequences 
and the corresponding device commands received from the 
device are merged into a composite set of grammar 
sequences and corresponding device commands based on 
grammar sequence and corresponding device commands 
received from a plurality of device in the zone. 

8. The method of claim 7, wherein when a link between 
the device and the distributed data processing system is no 
longer established, the grammar sequences and the corre- 
sponding device commands uniquely associated with the 
device are removed from the composite set of grammar 
sequences and corresponding device commands. 

9. The method of claim 1, wherein the distributed data 
processing system includes voice recognition hardware and 
software for a zone in the plurality of zones and wherein the 
voice recognition hardware and software is shared by a 
plurality of devices in the zone. 

10. The method of claim 1, wherein the distributed data 
processing system includes voice generation hardware and 
software for a zone in the plurality of zones and wherein the 
voice generation hardware and software is shared by a 
plurality of devices in the zone. 

11. The method of claim 1, wherein the converting a voice 
command into a grammar sequence includes determining if 
a device name in a grammar sequence is recognized and, if 
the device name is not recognized, the grammar sequence is 
forwarded to a second zone from the plurality of zones. 

12. The method of claim 1, wherein converting a voice 
command into a grammar sequence includes determining in 
a device name in a grammar sequence is recognized and, if 
the device name is recognized but the grammar sequence is 
incomplete, help messages are output to provide the alter- 
native choices available to complete the voice command. 

13. A distributed data processing system for controlling, 
through voice commands, a device located in one zone of a 
plurality of zones, the distributed data processing system 
comprising: 

registering means for registering the device with the 
distributed data processing system at a time the device 
is linked to the distributed data processing system: 

receiving means for receiving a set of grammar sequences 
and a corresponding set of device commands from the 
device at a lime of registration; 

detecting means for detecting a voice command for the 
device in the distributed data processing system; 

converting means for converting the voice command into 
a grammar sequence; and 

if the grammar sequence matches a grammar sequence in 
the set of grammar sequences, a issuing means for 
issuing a corresponding device command to the device. 

14. The distributed data processing system of claim 13, 
wherein the set of grammar sequences and the correspond- 
ing set of device commands are received from the device via 
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a wired connection by one of a power line based system, a 
universal serial bus, a firewire, and a proprietary wiring 
system. 

15. The distributed data processing system of claim 13, 
5 wherein the corresponding device command is sent to the 

device via a wired connection by one of a power line based 
system, a universal serial bus, and firewire, and a proprietary 
wiring system. 

16. The distributed data processing system of claim 13, 
10 wherein the set of grammar sequences and the correspond- 
ing set of device commands are received from the device via 
a wireless connection by one of an infrared data transfer, a 
900 MHz cordless phone transfer, a 2.4 GHz cordless phone 
transfer, and a proprietary wireless system. 

15 17. The distributed data processing system of claim 13, 
wherein the corresponding device command is sent to the 
device via a wireless connection by one of an infrared data 
transfer, a 900 MHz cordless phone transfer, a 2.4 GHz 
cordless phone transfer, and a proprietary wireless system. 

20 18. The distribute data processing system of claim 13, 
wherein the registering means is a service discovery proto- 
col. 

19. The distributed data processing system of claim 13, 
wherein the grammar sequences and the corresponding 

25 device commands received from the device are merged into 
a composite set of grammar sequences and corresponding 
device commands based on grammar sequences and corre- 
sponding device commands received from a plurality of 
devices in the zone. 

30 20. The distributed data processing system of claim 19, 
wherein when a link between the device and the distributed 
data processing system is no longer established, the gram- 
mar sequences and the corresponding device commands 
uniquely associated with the device are removed from the 

35 composite set of grammar sequences and corresponding 
device commands. 

21. The distributed data processing system of claim 13, 
wherein voice recognition hardware and software for a zone 
in the plurality of zones are shared by a plurality of devices 

40 in the zone. 

22. The distributed data processing system of claim 13, 
wherein voice generation hardware and software for in zone 
in the plurality of zones are shared by a plurality or devices 
in the zone. 

45 23. The distributed data processing system of claim 13, 
wherein the converting means for converting a voice com- 
mand into a grammar sequence includes determining, if a 
device name in a grammar sequence is recognized and, if the 
device name is not recognized, the grammar sequence is 

50 forwarded to a second zone from the plurality of zones. 

24. The distributed data processing system of claim 13, 
wherein the converting means for converting a voice com- 
mand into a grammar sequence includes determining if a 
device name in a grammar sequence is recognized and, if the 

55 device name is recognized but the grammar sequence is 
incomplete, help messages are output to provide the alter- 
native choices available to complete the voice command. 

25, A computer program product on a computer- real able 
medium for controlling, through voice commands, a device 

60 located in one zone of a plurality of zones, the computer 
program product comprising; 

instructions for registering the device will the distributed 
data processing system at a time the device is linked to 
tile distributed data processing system; 
65 instructions for receiving a set of grammar sequences and 
a corresponding set of device commands from the 
device at a time of registration; 
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instructions for detecting an voice command for the 
device in the distributed data processing system; 

instructions for converting the voice command into a 
grammar sequence; and 

if the grammar sequence matches a grammar sequence in 
the set of grammar sequences, instruction for issuing a 
corresponding device command to the device. 

26. The computer program product of claim 25, wherein 
the set of grammar sequences and the corresponding set of 
device commands are received from the device via a wired 
connection by one of a power line based system, a universal 
serial bus, a firewire, and a proprietary wiring system. 

27. The computer programing product of claim 25, 
wherein the corresponding device command is sent to the 
device via a wired connection by one of a power line based 
system, a universal serial bus, a firewire, until a proprietary 
wiring system. 

28. The computer program product of claim 25, wherein 
the set of grammar sequences and the corresponding set of 
device commands are received from the device via a wire- 
less connection by one of an infrared data transfer, a 900 
MHz cordless phone transfer, a 2.4 GHz cordless phone 
transfer, and a proprietary wireless system. 

29. The computer program product of claim 25, wherein 
the corresponding device command is sent to the device via 
a wireless connection by one of an infrared data transfer, a 
900 MHz cordless phone transfer, a 2.4 GHz cordless phone 
transfer, and a proprietary wireless system. 

30. The computer program product of claim 25, wherein 
the instructions for registering the device are a service 
discovery protocol. 

31. The computer program product of claim 25, wherein 
instructions merger the grammar sequences and the corre- 
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sponding device commands received from the device into a 
composite set of grammar sequence and corresponding 
device command based on grammar sequences and corre- 
sponding device commands received from a plurality of 
5 devices in the zone. 

32. The computer program product of claim 31, wherein 
when a link between the device and the distributed date 
processing system is no longer established, instructions 
remove the grammar sequences and the corresponding 

10 device commands uniquely associated with the device from 
the composite set of grammar sequences and corresponding 
device commands. 

33. The computer program product of claim 31, wherein 
instructions for voice recognition for a zone in the plurality 

15 of zones are shared by a plurality of devices in the zone. 

34. The computer program product of claim 31, wherein 
instructions for voice generation for a zone in the plurality 
of zones are shared by a plurality of devices in the zone. 

35. The computer program product of claim 31, wherein 
20 instructions for converting a voice command into a grammar 

sequence includes determining if a device name in a gram- 
mar sequence is recognized and, if the device name is not 
recognize, the grammar sequence is forwarded to a second 
zone from the plurality of zones. 

25 36. The computer program product of claim 31, wherein 
instructions for converting a voice command into a grammar 
sequence includes determining if a device name in a gram- 
mar sequence is recognized and, if the device name is 
recognized but the grammar sequence is incomplete, help 

30 messages are output to provide the alternative choices 
available to complete the voice command. 

* * * * * 



05/11/2004, EAST Version: 1.4.1 



